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Optimal  Strategies  for  Maximum-Number  Games 
by 

Milton  Ash  and  Wayne  Jones 

Recently  the  following  problem,  ^  which  is  but  one  of  a  genre  of  like 
problems was  posed:  Given  N  different  real  numbers  (e.g.,  N=1000)  on 
cards  face  down  on  a  table,  what  is  the  stopping  rule  so  that  the  probability 
of  choosing  the  largest  number  is  maximized?  The  additional  rules  of  the  gome 
are:  (a)  that  as  the  numbers  are  chosen,  the  cards  are  turned  up  so  that  a 
comparison  of  the  discards  can  be  made  with  the  immediate  choice,  and  (b) 
that  only  the  immediate  choice  can  be  declared  to  be  largest. 

First  define  two  state  subscripts:  State  k  >  :  a  choice  has  just  been 
made,  with  k  of  H  choices  remaining,  and  this  immediate  choice  happens  to  be 
the  largest  of  those  chosen  thus  far.  State  k  <  :  ditto,  except  that  this 
immediate  choice  doesn't  happen  to  be  the  largest  of  those  chosen  thus  far. 

Let  be  the  probability  oi  being  in  state  k  >  and,  if  one  stops.,  being 

declared  the  winner,  i.e.,  holding  the  largest  of  the  N  numbers.  P^  is 
defined  in  the  same  way  for  the  state  k  <.  Then  the  following  stopping  rule, 
"GO  until  N-k  are  chosen;  then  stop  if  in  state  k  >, "  which  maximizes  P.  y 

O  O  K  ^ 

O 

can  be  obtained  from  the  following  two  equations  of  state.  These  are  written 
by  applying  the  principle  of  optimality  of  dynamic  programming,  and  are  given 
by: 
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pk>  ’  ”6* 


If  STOP: 


If0°  :  >  +  (1- < 


'  p0>  =  1 


If  STOP:  0 

P  =  max  .  , 

k  ifG° :  y^>pk-i> + (i*  N^)pk.i< 


*  P(X  =  ° 


If  one  begins  to  choose  numbers,  k  >  kQ,  the  "if  GO"  >  "if  STOP"  quantities 

on  the  right  side  of  equation  (l)  will  hold,  until  for  some  k=k  "if  GO"  * 

o 

"If  STOP."  For  k  <  kQ,  "if  GO  <  If  STOP."  The  stopping  rule  is  to  GO  until 
N-kQ  numbers  are  chosen.  If  in  state  kQ  >,  then  STOP.  If  not,  keep  choosing 
until  the  first  k  >  state  is  reached  for  k  <  kQ,  then  STOP.  Again,  note  that 
k  designates  the  choices  remaining. 

To  find  kQ,  equation  (1)  at  k=kQ,  which  is  the  turning  point  in  the  choice 
sequence  from  GO  to  STOP  as  k  decreases,  yields 


1  "  N°  =  N+l-kPk  -J>  +  ^1“  H+I-k  )Pk  -1< 
00  00 


Eliminating  P.  by  iteration  using  equation  (2)  gives 

V 

p  V1 

(4)  1  *  T  =  N+l-k  +  (N-ko)  U  (N-jj(N+l-j) 


N+j-kr 


Using  the  fact  that  P^  _.>  »  — jj — -  ,  kQ  is  the  (smaller)  root  of 
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( 


(5) 


f 

j-i 


1 

N 


If  N  is  large  (like  1000 ),  i  above  is  replaced  by  zero,  and  kQ  is  given  to 
good  approximation  by 


1 


Then  kQ-  (l-e-1)N  and  N-kQ  =  N/e  »  O.368N.  The  stopping  rule  for  N-1000  is  to 

discard  randomly,  stopping  at  the  368th  number  if  it  is  the  largest  of  the 

discards  thu6  far.  If  not,  choose  the  first  number  after  the  368th  that  is 

the  largest  and  then  stop.  P.  ^  =  e"\  From  equations  (l)  and  (2),  for 

Ko 

k  >  kQ  and  (GO),  P^  =  P^.  This  implies  P^  >  =  P^^  =  ...  =  P^.  Then  the 

0  _i 

probability  of  winning  with  the  above  stopping  rule  is  e  . 

For  an  added  fillip,  assume  that  it  costs  C  dollars  per  choice,  and  that 

picking  the  largest  number  x  yields  a  prize  in  the  amount  of  x  dollars  plus 
B  B 

a  bonus  of  -  dollars  (or  an  additional  cost  of  -  dollars  corresponding  to  a 
house  cut).  Then  what  is  the  stopping  rule  to  maximize  the  expectation  of 
winning? 

lhe  upper  right  side  of  (l)  now  reads:  If  STOP:  (^j^)(x  +  2  _  nc).  The 

g 

rest  remains  the  same,  giving,  for  large  N,  k  =  [  1-exp- (x  +  -  -  NC)]N, 
which  reduces  to  the  old  game  for  x*l,  B=C=0.  This  has  obvious  interesting 
ramifications  for  various  sizes  of  the  largest  number  x,  ante  C,  bonus 
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constant  +  B,  and  the  total  number  of  cards  N.  For  example,  if  x  is  very 
large  or  small  compared  to  NC,  and  if  B  >  0  and  such  that  x  +  —  -  NC  is 
alvays  positive,  then  kQ  -*  N,  so  that  it  pays  to  stop  early. 


(1)  B.  H.  Bissinger,  C.  Siegel,  American  Mathematical  Monthly,  Vol.  70, 

No.  3,  p.  336,  March  1963. 

(2)  M.  Sakaguchi  "Dynamic  Programming  of  Some  Sequential  Sampling  Design, " 

Jo.  Math.  Analysis  and  Applications,  Vol.  2,  No.  3*  PP-  kU6-k66,  June  1961. 

(3)  6.  Gluss  "A  Note  on  a  Computational  Approximation  to  the  Two  Machine 
Problem, "  Jo.  Information  and  Control,  Vol.  1,  pp.  268-275,  1957* 


« 


UICLA8S27XKD 


Systen  Development  Corporation, 

Santa  Monica,  California 
OPTIMAL  8®ATKHB5  FOR  MAXIMUM- 
mJMBKR  OAKES. 

Soiantlfie  rapt.,  SP- 1216/000/00, 
bgr  M.  Ash,  W.  Jones.  22  May  1963, 

Up.,  3  refs. 

Unclassified  report 
DESCRIPTORS:  Mathematical  Analysis. 


UNCLASSIFIED 


